Multifunction Thesaurus For Russian Word Processing
نویسنده
چکیده
A new type of thesaurus for word processing is proposed. It comprises 7 semantic and 8 syntagmatic types of links between Russian words and collocations. The original version now includes ca. 76,000 basic dictionary entries, 660,000 semantic and 292,000 syntagmatic links, English interface, and communication with any text editor. Methods of delivery enriching are used based on generic and synonymous links.
منابع مشابه
Evaluation experiments on related terms search in Wikipedia: Information Content and Adapted HITS (In Russian)
The classification of metrics and algorithms search for related terms via WordNet, Roget’s Thesaurus, and Wikipedia was extended to include adapted HITS algorithm. Evaluation experiments on Information Content and adapted HITS algorithm are described. The test collection of Russian word pairs with human-assigned similarity judgments is proposed.
متن کاملWord Association Thesaurus As a Resource for Building WordNet
The goal of the present paper is to report on the on-going research for applying psycholinguistic resources to building a WordNet-like lexicon of the Russian language. We are to survey different kinds of the linguistic data that can be extracted from a Word Association Thesaurus, a resource representing the results of a largescaled free association test. In addition, we will give a comparison o...
متن کاملSociopolitical Thesaurus in Concept-based Information Retrieval
In CLEF2005 experiments we used bilingual Russian-English Sociopolitical thesaurus that we constructed for more than 10 years specially as a tool for automatic text processing in information-retrieval tasks. The same resource and the same algorithm were used for ad-hoc and domain –specific tasks.
متن کاملRuThes Linguistic Ontology vs. Russian Wordnets
The paper describes the structure and current state of RuThes – thesaurus of Russian language, constructed as a linguistic ontology. We compare RuThes structure with the WordNet structure, describe principles for inclusion of multiword expressions, types of relations, experiments and applications based on RuThes. For a long time RuThes has been developed within various NLP and informationretrie...
متن کاملارائه روشی برای استخراج کلمات کلیدی و وزندهی کلمات برای بهبود طبقهبندی متون فارسی
Due to ever-increasing information expansion and existing huge amount of unstructured documents, usage of keywords plays a very important role in information retrieval. Because of a manually-extraction of keywords faces various challenges, their automated extraction seems inevitable. In this research, it has been tried to use a thesaurus, (a structured word-net) to automatically extract them. A...
متن کامل